Finding optimal threshold for correction error reads in DNA assembling
نویسندگان
چکیده
منابع مشابه
Optimizing error correction of RNAseq reads
Motivation: The correction of sequencing errors contained in Illumina reads derived from genomic DNA is a common pre-processing step in many de novo genome assembly pipelines, and has been shown to improved the quality of resultant assemblies. In contrast, the correction of errors in transcriptome sequence data is much less common, but can potentially yield similar improvements in mapping and a...
متن کاملAn Improved Algorithm for Error Correction of Reads in DNA Fragment Assembly
Most large-scale genome sequencing projects use the whole-genome shotgun sequencing strategy, in which a genome is shattered into numerous small fragments and the fragments are then sequenced from both ends. The resulting sequences (called fragments or reads) must then be assembled to reconstruct the chromosomes of the genome. Current technology produces reads of length 600-800 base pairs (bp) ...
متن کاملPREMIER - PRobabilistic error-correction using Markov inference in errored reads
THIS PAPER IS ELIGIBLE FOR THE STUDENT PAPER AWARD. In this work we present a flexible, probabilistic and reference-free method of error correction for high throughput DNA sequencing data. The key is to exploit the high coverage of sequencing data and model short sequence outputs as independent realizations of a Hidden Markov Model (HMM). We pose the problem of error correction of reads as one ...
متن کاملError correction and assembly complexity of single molecule sequencing reads
Third generation single molecule sequencing technology is poised to revolutionize genomics by enabling the sequencing of long, individual molecules of DNA and RNA. These technologies now routinely produce reads exceeding 5,000 basepairs, and can achieve reads as long as 50,000 basepairs. Here we evaluate the limits of single molecule sequencing by assessing the impact of long read sequencing in...
متن کاملError Correction in Dna Computing
We present a method of transforming an extract-based DNA computation that is error-prone into one that is relatively error-free. These improvements in error rates are achieved without the supposition of any improvements in the reliability of the underlying laboratory techniques. We assume that only two types of errors are possible: a DNA strand may be incorrectly processed or it may be lost ent...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: BMC Bioinformatics
سال: 2009
ISSN: 1471-2105
DOI: 10.1186/1471-2105-10-s1-s15